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Abstract 

The intersection graph of a collection of trapezoids with corner points lying on two parallel 
lines is called a trapezoid graph. Using binary indexed tree data structure, we improve 
algorithms for calculating the size and the number of minimum vertex covers (or independent 
sets), as well as the total number of vertex covers, and reduce the time complexity from O(n^) 
to O(nlogn), where n is the number of trapezoids. Furthermore, we present the family of 
counterexamples for recently proposed algorithm with time complexity 0{n^) for calculating 
the maximum cardinality matching in trapezoid graphs. 

Keywords: trapezoid graphs; vertex cover; matching; algorithms; data structures; binary 
indexed tree. 
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1 Introduction 

A trapezoid diagram consists of two horizontal lines and a set of trapezoids with corner points 
lying on these two lines. A graph G = {V, E) is a trapezoid graph when a trapezoid diagram 
exists with trapezoid set T, such that each vertex i ^ V corresponds to a trapezoid T(i) and 
an edge exists {i,j) G if and only if trapezoids T{i) and T{j) intersect within the trapezoid 
diagram. A trapezoid T{i) between these lines has four corner points a(i), c{i) and d{i) - 
which represent the upper left, upper right, lower left and lower right corner points of trapezoid i, 
respectively. No two trapezoids share a common endpoint. 

Ma and Spinrad [2S] showed that trapezoid graphs can be recognized in 0{n?) time, while 
Mertzios and Corneil [23] designed structural trapezoid recognition algorithm based on the vertex 
splitting method in 0{n{m + n)) time, which is easier for implementation. Here n stands for 
the number of vertices and m for the number of edges. Trapezoid graphs are perfect, subclass of 
cocomparability graphs and properly contain both interval graphs and permutation graphs |2U) . 

Trapezoid graphs were first investigated by Corneil and Kamula [9]. These graphs and their 
generalizations were applied in various fields, including modeling channel routing problems in 
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VLSI design [TT] and identifying the optimal chain of non-overlapping fragments in bioinformat- 
ics [1]. Many common graph problems, such as minimum connected dominating sets [29], all-pair 
shortest paths |26j . maximum weighted cliques all cut vertices p£J, chromatic number and 
clique cover |12j . all hinge vertices 0], minimum vertex covers [22] in trapezoid graphs, can be 
solved in polynomial time. For other related problems see [6| [TO l [T8 l [T9]. 

Let G = iy, E) be a simple undirected graph with \V\ = n and \E\ = m. A subset C QV is 
called a vertex cover (VC for short) of G if and only if every edge in E has at least one endpoint 
in C. A vertex cover G is called a minimal vertex cover if and only if no proper subset of C 
is also a vertex cover. The size of a vertex cover is the number of vertices that it contains. A 
vertex cover with minimum size is called a minimum vertex cover. Notably, a minimum VC is 
always a minimal VC, but a minimal VC may not be a minimum VC. 

Given a graph G and a fixed parameter k, determining whether G has a vertex cover of at 
most k vertices is one of the best-known NP complete problems [17] . Many recent studies have 
focused on developing faster algorithms for the VC problem in special classes of graphs. The 
number of vertex covers in a graph is important, particularly because this characteristic typically 
arises in problems related to network reliability [28]. Okamoto et al. |27] proposed 0{n + m) 
time algorithms for counting the numbers of independent sets and maximum independent sets 
in a chordal graph. Lin et al. |20[ |2T] considered interval graphs and obtained efficient linear 
0{n) algorithms for counting the number of VCs, minimal VCs, minimum VCs, and maximum 
minimal VCs in an interval graph. Recently, Lin and Chen [22] presented O(n^) algorithms for 
the same vertex cover properties in a trapezoid graph. 

In graph theory, the notions of vertex cover and independent set are dual to each other. A 
subset I CV is called an independent set of G if and only if every edge in E is incident on no 
more than one vertex in /. 

Theorem 1.1 ([8j) A set G is a minimal (minimum) vertex cover of G if and only ifV — G 
is an maximal (maximum) independent set of G. 

A matching in a graph is a set of edges in which no two edges are adjacent. A single edge 
in a graph is obviously a matching. A maximum matching is a matching with the maximum 
cardinality and the cardinality of the maximum matching is called a matching number. Cur- 
rently, the best known algorithm for the constructing maximum matching in general graphs is 
Edmonds' algorithm I25j. which is based on the alternating paths, blossom and shrinking 
and runs in time OdSli^/jV^). For special classes of graphs, such as interval, chordal and per- 
mutation graphs, more efficient algorithms were designed (see [21 [3 [7] and references therein). 
Ghosh and Pal [T5] presented an efficient algorithm to find the maximum matching in trapezoid 
graphs, which turns out to be not correct. 

The rest of the paper is organized as follows. In Section 2 we introduce the binary indexed 
tree data structure. In Section 3 we design O(nlogn) time dynamic programming algorithm for 
calculating the size of the minimum vertex cover, the number of vertex covers and the number 
of minimum vertex covers, improving the algorithms from ^22j. In Section 4 we present family of 
counterexamples for O(n^) algorithm from [15| for finding the maximum matching in trapezoid 
graphs. 

2 Binary indexed data structure 

The binary indexed tree (BIT) is an efficient data structure introduced by Fenwick [13] for 
maintaining the cumulative frequencies. The BIT was first used to support dynamic arithmetic 
data compression and algorithm coding. 
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Let A be an array of n elements. The binary indexed tree supports the following basic 
operations: 

(i) for given value x and index i, add x to the element A{i), 1 < i < n; 
(ii) for given interval [1, i], find the sum of values 74(1), ^(2), . . . , A{i), 1 < i < n. 

Naive implementation of these operations have complexities 0{1) and 0(n), respectively. We 
can achieve better complexity, if we speed up the second operation which will also affect the first 
operation. Another approach is to maintain all partial sums from ^(1) to A{i) for 1 < z < n. 
This way the operations have complexities 0(n) and 0(1), respectively. 

The main idea of binary indexed tree structure is that sum of elements from the segment 
can be represented as sum of appropriate set of subsegments. The BIT structure is based 
on decomposition of the cumulative sums into segments and the operations to access this data 
structure are based on the binary representation of the index. This way the time complexity for 
both operations will be the same O(logn). We want to construct these subsegments such that 
each element is contained in at most logn subsegments, which means that we have to change the 
partial sums of at most log n subsegments for the first procedure. On the other hand, we want 
to construct these subsegments such that for every I <i <n the subsegment is divided in 
at most logn subsegments. It follows that these subsegments can be [i — 2''(') + where r{i) 
is the position of the last digit 1 (from left to right) in the binary representation of the index i. 
We store the sums of subsegments in the array Tree (see Algorithm 1 and Algorithm 2), and 
the element Tree{i) represents the sum of elements from index i — 2''^*^ + 1 to i. The structure is 
space-efficient in the sense that it needs the same amount of storage as just a simple array of n 
elements. Instead of sum, one can use any distributive function (such as maximum, minimum, 
product). We have the following 

Proposition 2.1 Calculating the sum of the elements from A{1) to A{i), 1 < i < n, and 
updating the element A{i) in the binary indexed tree is performed in O(logn) time. 

The fundamental operation involves calculating a new index by stripping the least significant 
1 from the old index, and repeating this operation until the index is zero. For example, to read 
the cumulative sum ^(1) + ^(2) + . . . + ^(11), we form a sum 

Tree(ll) + Tree{10) + rree(8) = (^(11)) + (^(9) + A(10)) + (a{1) + A{2) + ... + A(8)) . 

An illustrative example is presented in Table 1. 
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Table 1: An example with array A of length n = 16, binary indexed tree Tree and cumulative 
sums. 

Using the fast bitwise AND operator, we can calculate the largest power of 2 dividing the 
integer n simply as r(n) = n AND (— n). 
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Algorithm 1: Updating the binary indexed tree. 
Input: The value value and element index index. 
while index < n do 

Tree[index] = Tree[index] + value; 

index = index + {index and (—index)); 
end 



Algorithm 2: Calculating the cumulative sum. 
Input: The index index. 

Output: The sum of elements j4[2], . . . , A[index]. 
sum = 0; 

while index > do 

sum = sum + Tree[index]; 

index = index — (index and (—index)); 
end 

return sum; 



3 The algorithm for minimum vertex covers 

Let T = {1,2, ... ,n} denote the set of trapezoids in the trapezoid graph G = (V,E). For 
simplicity, the trapezoid in T that corresponds to vertex i in is called trapezoid T(i). Without 
loss of generality, the points on each horizontal line of the trapezoid diagram are labeled with 
distinct integers between 1 and 2n. 

For simplicity and easier implementation, we will consider the maximum independent sets 
(according to Theorem II. ip . At the beginning, we add two dummy trapezoids and n + 1 to 
T, where a(0) = 6(0) = c(0) = d(0) = for the vertex and a(n + 1) = b(n + 1) = c(n + 1) = 
d(n + 1) = 2n + 1 for the vertex n + 1. Trapezoid i lies entirely to the left of trapezoid j, denoted 
by « <C j, if b(i) < a(j) and d(i) < c(j). It follows that ^ is a partial order over the trapezoid 
set T and (T, <C) is a strictly partially ordered set. 

In order to get 0(n log n) algorithms, we will use binary indexed tree data structure for sum 
and maximum functions. 

3.1 The size of maximum independent set and the total number of indepen- 
dent sets 

Arrange all trapezoids by the upper left corner a(i). Let maxAnd(i) denote the size of the 
maximum independent set of T(l), T(2), . . . , T(i — l),T(i) that contain trapezoid T(i). For the 
starting value, we set max-ind(0) = 0. The following recurrent relation holds 

max-ind(i) = max{l + maxJ.nd(k),0 < k < i and T(k) <C T(i)}. 

This produces a simple O(n^) dynamic programming algorithm (similar to [22j). 

To speed up the above algorithm, we will store the values from maxJnd in the binary indexed 
tree cumjmax for maintaining the partial maximums. First initialize each element of cumjmax 
with —1. We need an additional array index, such that index(j) contains the index of the 
trapezoid with left or right coordinate equal to j. We are going to traverse the coordinates from 
to 2n + 1, and let j be the current coordinate. In order to detect which already processed 
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trapezoids are to the left of the current trapezoid T{i), we calculate the value maxjind(i) when 
the current coordinate is the left upper coordinate of a{i) and insert it in the binary indexed 
tree when it is the right upper coordinate b{i). Then we have two cases: 

(i) coordinate j is the upper left coordinate of the trapezoid T{i). We only need to consider 
the trapezoids that have lower right coordinate smaller than c{i). As explained, these 
trapezoids have their corresponding values stored in the segment [l,c(i)] in the binary 
indexed tree and therefore we can calculate max-ind{i) based on the maximum values 
among cumjmax{l), cum_max{2), . . . , cumjrnax{c{i)). 

{a) coordinate j is the upper right coordinate of the trapezoid T{i). In this case, the value 
maxAnd{i) is already calculated and we put this value in the binary indexed tree at the 
position d{i). 

The final solution is max_ind{n + 1) — 1. The pseudo-code is presented in Algorithm 3. 



Algorithm 3: The size of the maximum independent set. 
Input: The trapezoids T and the array index. 

Output: The number of maximum independent sets stored in the array max-ind. 
Initialize binary indexed tree for maximum cum_max; 
for J = to 2n + 1 do 

i = index[j]; 

if a[i] = j then 

I max-ind[i] = Calculate (c[i]) + 1; 
end 

else // b[i] = j 
I \JpdsLte{d[i],maxJnd[i]); 
end 
end 

return maxAnd[n + 1] — 1; 



The set of all VCs and the number of VCs of G are denoted by VC{G) and \VC{G)\, 
respectively. Let Nq{v) be the set of vertices adjacent to v in the graph G. In \^2\ the authors 
proved the following result. 

Lemma 3.1 For a graph G and arbitrary vertex v , it holds 

\VG{G)\ = \VC{G -v)\ + \VG{G -v- Ng{v))\. 

For counting the total number of independent sets, we can use the same method as above. 
The only difference is that we will have sum instead of maximum function in the binary indexed 
tree. Let numJ,nd{i) denote the number of independent sets of trapezoids T(l), T(2), . . . , T{i — 
l),T{i) that contain trapezoid T{i). According to Lemma l3.ll the following recurrent relation 
holds 

numjind{i) = 1 + num_ind{k). 
0<k<i, T(fc)<T(i) 

The total number of independent sets is simply numJ.nd{n + 1) — 1 and time complexity is the 
same O(nlogn). 
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3.2 The number of maximum independent sets 

For counting the number of maximum independent sets, we will need one additional array 
nurri-max-ind, such that numjmax_ind{i) represents the number of maximum independent sets 
among trapezoids T(1),T(2), . . . ,T(i — l),T{i) such that T{i) belongs to the independent set. 
After running the algorithm for calculating the array maxJnd, for each 1 < i < n we need to sum 
the number of independent sets of all indices j, such that T{j) <^ T{i) and maxJnd{j) + 1 = 
max_ind(i) in order to get the number of independent sets with maximum cardinality. The 
pseudo-code of the algorithm for counting the number of maximum independent sets from [22] 
is presented in Algorithm 4. 



Algorithm 4: The number of maximum independent sets. 
Input: The trapezoids T and the array maxjind. 

Output: The number of maximum independent sets stored in the array numjmaxJnd. 
numjnnax_ind\fS\ = 1; 
for z = 1 to n + 1 do 
numjinaxjind[i] = 0; 
for J = to z — 1 do 

if {maxjind[j] + 1 = max_ind[i]) and {a[j] > b[i]) and {c[j] > d[i\) then 
I num_maxJnd[i] = num_maxJnd[i] + num_maxJnd[j]; 
end 
end 
end 

return num_maxjind[n + 1] — 1; 



We will use again binary indexed trees for designing 0{n log n) algorithm. The main problem 
is how to manipulate the partial sums. Namely, for each trapezoid T(i), we need to sum the 
values numjmaxJ,nd{j) of those trapezoids T(j) which are entirely on the left of T{i) with 
additional condition maxJ,nd{j) + 1 = max_ind{i). 

This can be done by considering all pairs {k, A: + 1) of two neighboring values 1 < k < 
maxjind{n -\- 1). Using two additional arrays for the coordinates and trapezoid indices, we can 
traverse trapezoids T{i) from left to right, such that max-ind{i) = k or maxJ,nd{i) = A; + 1. 
First, for each 1 < A; < maxJ,nd{n + 1) one needs to carefully preprocess all trapezoids and 
store the indices and coordinates of all trapezoids T{i) with max_ind{i) = k in order to keep the 
memory limit 0{n). Therefore, all trapezoids will be stored in the dynamic array of arrays S, 
such that S{k) contains all trapezoids T{i) with maxAnd{i) = k. Then we fill the binary indexed 
tree with the values for trapezoids with maxjind{i) = k, and calculate the value numjmaxJnd{i) 
for trapezoids with maxjind{i) = A: + 1, as before. Instead of resetting the binary indexed tree 
for each k, we can again traverse the trapezoids with maxJnd{i) = k and update the values of 
BIT to 0. 

Since every trapezoid will be added and removed exactly once from the binary indexed tree 
data structure, the total time complexity is 0(n log n). This reusing of the data structure is 
novel approach to the best of our knowledge and makes this problem very interesting. 

We conclude this section by summing the results in the following theorem. 

Theorem 3.2 The proposed algorithms calculate the size and the number of maximum indepen- 
dent sets, as well as the total number of independent sets in time O(nlogn) of trapezoid graph 
G with n vertices. 
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One can easily extend this algorithm for efficient construction the independence polynomial 
of a trapezoid graph 

a(G) 

I{G,x) = skx^, 

k=0 

where Sk is the number of independent sets of cardinality k and a{G) is the independence number 
of G. 

4 The algorithm for the maximum matching 

Ghosh and Pal [15] presented an efficient algorithm for finding the maximum matching in trape- 
zoid graphs. The proposed algorithm takes O(n^) time and 0{n + m) space. They defined the 
right spread f{i) of a trapezoid T(i) as the maximum max{6(z), d{i)}. Let M be the set of edges 
which form a matching of the graph G. Deletion of a vertex from the graph means deletion of 
that vertex and its adjacent edges. 

The algorithm is designed as follows. Calculate the right spread f{i) and arrange all trape- 
zoids by / in ascending order for 1 < i < n in linear time 0{n). Select the trapezoid i with the 
minimum value of the right spread f{i)- After that, select the second minimum value /(j). If 
trapezoids T[i) and T{j) are adjacent, then put the edge in the maximum matching M, 
remove the vertices i and j from the graph G and mark the corresponding elements of the array 
/. If E[G) then continue selecting the next minimum elements f{j') from the array / 
until the trapezoids i and j' are adjacent. In that case remove the vertices i and j', put the 
edge {i^j') in the maximum matching M and mark the corresponding elements of /; otherwise 
remove the trapezoid i from the graph G since it cannot be matched. Continue this procedure 
as long as > 1. Since for each trapezoid we potentially traverse all trapezoids with larger 

values /(i), the time complexity of the proposed algorithm is O(n^). 




Figure 1: The minimal counterexample for the matching algorithm. 

Obviously, the algorithm works for any set of n = 3 trapezoids . In the configuration on 
Figure 2, there are four trapezoids and only trapezoids r(3) and r(4) are not adjacent. The 
trapezoids are ordered such that /(I) < /(2) < /(3) < /(4). By proposed algorithm in [15], we 
will match the trapezoids 1 and 2 and get the maximal matching of cardinality 1. But obviously, 
the size of the maximum matching is 2 by pairing the trapezoids (1,3) and (2,4). Therefore, 
this algorithm is not correct and gives only an approximation for the maximum matching. One 
can easily construct a family of counterexamples based on this minimal example by adding 
trapezoids to the left and right. 

We leave as an open problem to design more efficient algorithm for finding maximum match- 
ing in trapezoid graphs. 
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